Edge - Based Best - First Chart

نویسندگان

  • Eugene Charniak
  • Sharon Goldwater
چکیده

Best-rst probabilistic chart parsing attempts to parse eeciently by working on edges that are judged \best" by some probabilistic gure of merit (FOM). Recent work has used probabilistic context-free grammars (PCFGs) to assign probabilities to constituents, and to use these probabilities as the starting point for the FOM. This paper extends this approach to using a probabilistic FOM to judge edges (incomplete constituents), thereby giving a much ner-grained control over parsing eeort. We show how this can be accomplished in a particularly simple way using the common idea of binarizing the PCFG. The results obtained are about a factor of twenty improvement over the best prior results | that is, our parser achieves equivalent results using one twentieth the number of edges. Furthermore we show that this improvement is obtained with parsing precision and recall levels superior to those achieved by exhaustive parsing. Abstract Best-rst probabilistic chart parsing attempts to parse eeciently by working on edges that are judged \best" by some probabilistic gure of merit (FOM). Recent work has used probabilistic context-free grammars (PCFGs) to assign probabilities to constituents, and to use these probabilities as the starting point for the FOM. This paper extends this approach to using a probabilistic FOM to judge edges (incomplete constituents), thereby giving a much ner-grained control over parsing eeort. We show how this can be accomplished in a particularly simple way using the common idea of binarizing the PCFG. The results obtained are about a factor of twenty improvement over the best prior results | that is, our parser achieves equivalent results using one twentieth the number of edges. Furthermore we show that this improvement is obtained with parsing precision and recall levels superior to those achieved by exhaustive parsing.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Determining the Pollinizer for Pecan Cultivars

This study was conducted to determine the best pollinizer for five selected pecan cultivars in southwest of Iran at Safiabad Agricultural Research Center in 2014-2015. The cultivars included: 'GraTex', '10J', 'Wichita 6J', 'GraKing', 'Choctaw' as pollinated cultivars (♀) and 'GraTex', 'Peruque', 'Comanche 4M', '10J', 'Wichita 6J', 'Mohawk', 'Mahan', 'Stuart 2J', '3J', 'Stuart 4J', 'GraKing', 'C...

متن کامل

A Robust Dispersion Control Chart Based on M-estimate

Process control charts are proven techniques for improving quality. Specifying the control limits is the most important step in designing a control chart. The presence of outliers may extremely affect the estimates of parameters using classical methods. Robust estimators which are not affected by outliers or the small departures from the model assumptions are applied in this paper to specify th...

متن کامل

Edge-Based Best-First Chart Parsing

Best-first probabilistic chart parsing attempts to parse efficiently by working on edges that are judged ~'best" by some probabilistic figure of merit (FOM). Recent work has used probabilistic context-free grammars (PCFGs) to assign probabilities to constituents, and to use these probabilities as the starting point for the FOM. This paper extends this approach to using a probabilistic FOM to ju...

متن کامل

Some results on vertex-edge Wiener polynomials and indices of graphs

The vertex-edge Wiener polynomials of a simple connected graph are defined based on the distances between vertices and edges of that graph. The first derivative of these polynomials at one are called the vertex-edge Wiener indices. In this paper, we express some basic properties of the first and second vertex-edge Wiener polynomials of simple connected graphs and compare the first and second ve...

متن کامل

A Tabulation-Based Parsing Method that Reduces Copying

This paper presents a new bottom-up chart parsing algorithm for Prolog along with a compilation procedure that reduces the amount of copying at run-time to a constant number (2) per edge. It has applications to unification-based grammars with very large partially ordered categories, in which copying is expensive, and can facilitate the use of more sophisticated indexing strategies for retrievin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998